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Abstract 

We introduce the convex combinatorial optimization problem, a far reaching generaliza- 
tion of the standard linear combinatorial optimization problem. We show that it is strongly 
polynomial time solvable over any edge-guaranteed family, and discuss several applications. 

1 Introduction 

The general linear combinatorial optimization problem is the following. 

Linear Combinatorial optimization. Given a family T C 2 N of subsets of N := {1, . . . ,n) 
and a rational weighting w : N — ► Q, find F £ T of maximum weight w(F) := X^gF W U)- 

There is a massive body of knowledge on the computational complexity of this problem for various 
classes of families presented in various ways (in terms of n and sometimes additional parameters), 
and efficient algorithms in numerous cases, cf. [13]. F° r instance, if J- is the family of stable sets 
in a given graph with vertex set N then the problem is NP-hard whereas if J- is the family of 
matchings in a given graph with edge set N then the problem is polynomial time solvable. 

In this article we consider the following generalization of linear combinatorial optimization. 

Convex combinatorial optimization. Given T Q 2 N with N = {1, . . . ,n}, & vectorial weight- 
ing w : N — > Q d , and a convex functional c : Q d — ► Q, find F G T of maximum value c(w(F)). 

The standard linear combinatorial optimization problem over a family T is recovered as the 
special case with d = 1, uu : N — * Q weighting by scalars, and c : Q — ► Q : x i— > x the identity. 

Convex combinatorial optimization has a very broad expressive power and conveniently cap- 
tures a variety of problems studied in the operations research and mathematical programming 
literature including quadratic assignment, inventory management, scheduling, reliability, bar- 
gaining games, clustering, and vector partitioning, see HJ 13 IH3 112 [22} [23 02] and references 
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therein. In Section 3 we discuss some of these applications in detail and demonstrate that, as a 
consequence of our framework, all admit a simple unified strongly polynomial time algorithm. 

A particularly successful general methodology for linear combinatorial optimization is the geo- 
metric approach inaugurated by Edmonds 17 and culminated in Grotschel-Lovasz-Schrijver |25j . 
outlined as follows. With each family member F G J- is associated its indicator lp := YljeF ^-j 
with lj the jth standard unit vector in Q n , and with the family is associated the polytope 

V T : = conv{l F :Fe^} C Q" . 

Extending w to Q n by w(x) := Y2j=i w U) x ji the problem reduces to maximizing the linear func- 
tional w over . This leads to the study of facets of \ when these can be suitably controlled, 
the problem is polynomial time solvable via the ellipsoid method 33 for linear programming. 

In this article we show that the geometric approach can be usefully exploited so as to yield 
a widely applicable general methodology for convex combinatorial optimization as well. Our 
framework leads to the study of edge- directions of \ when these can be suitably controlled, the 
problem is efficiently solvable via zonotope (or hyperplane arrangement) methods as follows. 

Theorem 1.1 For any fixed d, there is a strongly polynomial oracle time algorithm solving convex 
combinatorial optimization over any edge- guaranteed family presented by a membership oracle. 

The precise definition of an edge-guaranteed family will be given in Section 2: all families un- 
derlying the various applications in Section 3 naturally possess this property. The assumption of 
fixed d is also natural and necessary: already for d = 1, the problem generalizes linear combina- 
torial optimization which is frequently intractable; and when d is variable, the problem captures 
NP-hard instances even for the simple power set family T = 2 N , see Example 13. II below. 

The main part of the proof of this theorem is a reduction of the convex combinatorial optimiza- 
tion problem over a family T to the solution of polynomially many standard linear combinatorial 
optimization counterparts over the same family T. The reduction makes use of several results 
about zonotopes which are available in combinatorial and computational geometry, but have not 
been so far integrated and harnessed in a systematic way to discrete optimization. 

The repeated solution of each of the standard linear combinatorial optimization counterparts 
can be done following either one of the following two approaches. The first is to use any efficient 
ad-hoc algorithm available in the literature for the specific family T at hand: this typically 
leads to the best overall running time. This approach indeed applies to all of the applications 
discussed in Section 3, since each admits a very fast ad-hoc algorithm (ranging from simple 
greedy to sophisticated min-cost flow). The second approach, which is generic and works for any 
J 7 , takes advantage of the fact that a test set (cf. [10]) is readily available for any edge-guaranteed 
family, and (see |44| and references therein), using scaling |19j and Diophantine approximation 
20 , allows the efficient solution of the necessary linear optimization counterparts. 

The article is organized as follows. In Section 2 we prove Theorem II .11 as well as some other 
results, and discuss some relevant issues, as follows. In §2.1 we discuss the necessary preliminaries 
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on zonotopes and edge-directions. In §2.2 we prove Theorem 12 . 61 providing the reduction of con- 
vex combinatorial optimization to polynomially many standard linear combinatorial optimization 
counterparts. In §2.3 we discuss the generic approach for solving the counterparts and combine 
it with Theorem 12.61 to establish Theorem 11.11 In §2.4 we discuss the problem of finding short 
monotone paths on (0, l)-polytopes, provide Lemma I2.1UI which is a certain (0, l)-analog of the 
Klee-Minty cube [HE], and raise some questions. In §2.5 we consider classes of edge-guaranteed 
families and conclude Corollary 12 . 121 concerning such edge-well-behaved classes (defined therein). 
In §2.6 we discuss projection representation which sometimes helps control edge-directions. Sec- 
tion 3 is devoted to applications: in §3.1 we discuss quadratic assignment and matroids, whereas 
in §3.2 we make use of projection representation and discuss in detail the broadly applicable shape 
vector partitioning problem. The last Section 4 contains some final remarks and open problems. 

We conclude this introduction with some comments. First, our results make use of and provide 
an efficient enumeration of the vertices of the polytope := coiw{w(F) : F € J-} which is a 
projection of V^; as the maximum of a convex functional c over is attained at a vertex and 
each vertex has the form w(F) with F € this provides a strategy for addressing the convex 
combinatorial optimization problem. One of the difficulties we overcome is that the number of 
sets in T is typically exponential in n and hence it is generally impossible to construct directly 
efficiently. As a consequence of our efficient vertex enumeration of , our results immediately 
extend to the larger class of problems where c is any functional which is guaranteed to attain a 
maximum over at a vertex, e.g., when c is (edge -)quasi-convex on 7^^*, see |30j . In particular, 
our results extend to functions c which are (asymmetric) Schur convex when the edge-directions 
of are differences of standard unit vectors in Q d , again see (30] - Also, the results can be 
generalized to some extent from (0, l)-problems to integer programming. Second, we note that 
in studying edge-directions, we make use of projections of polytopes; thus the Billera-Sturmfels 
theory of fiber polytopes jOj, related to various aspects of polytope projection, may be helpful in 
the classification of edge-well-behaved classes of families. Also, some new questions that we raise 
about graphs of (0, l)-polytopes might be addressed through the Billera-Sarangarajan universal 
embedding of such polytopes as travelling salesman polytopes 8 . 

We hope this exposition will make our framework a widely accessible tool in the arsenal of 
discrete optimization, and will stimulate the study of edge-directions of polytopes for various 
combinatorially defined families. Since convex combinatorial optimization is often intractable, 
there is also much room for the study of approximation algorithms for this problem for various 
families and we hope this article will stimulate research on this yet unexplored ground. 



2 Edge-Directions and the Algorithmic Solution 

2.1 Edge-directions and zonotopes 

We start by introducing the necessary terminology and collecting several facts that we shall make 
use of; for some we only provide a reference and for others we provide a short proof. 
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The zonotope generated by a set of vectors E = {e 1 , . . . , e m } in Q is the Minkowsky sum 

m ( m \ ( m \ 

Z = zone(£) := [-e*, A = \ V* : -1 < Aj < 1 \ = conv I A^ : A, = ±1 \ C Q d . 

i=l li=l J U=l J 

The following bound on the number of vertices of zonotopes has been rediscovered many times over 
the years; see e.g. I28j for early references and |231 147] for recent extensions and refinements. 



Lemma 2.1 The number of vertices of any d- dimensional zonotope generated by m vectors is at 
most 2^2iZo ( r "i~ 1 ) • Thus, for fixed d it is 0(m d_1 ) and hence polynomially bounded in m. 

Each vector a E Q d is also interpreted as the linear functional on Q d given via the standard 
inner product a ■ x = Y2i=i a i x i- The normal cone of a polytope V at its face F is the (relatively 
open) cone Cp of those linear functionals a which are maximized over V precisely at points of 
F. The following computational analogue of Lemma l2.1l is provided by the algorithm in |15[ HHj 
(the latter reference provides a necessary correction of the former); some extensions are again in 

EO 

Lemma 2.2 Fix any d. Then all vertices of any d- dimensional zonotope Z generated by m given 
vectors can be listed, each vertex u along with a linear functional a{u) € uniquely maximized 
over Z at u, in strongly polynomial time using 0{m d ~ 1 ) arithmetic operations. 

Note that throughout we are mainly interested in strongly polynomial (oracle) time algorithms, 
that is, algorithms that perform a polynomial number of arithmetic operations (and calls to the 
relevant oracles if any) and are also polynomial time in the Turing computation model. 

The collection of normal cones of a polytope V at all faces is called the normal fan of V (see 
|26|). A polytope Z is a refinement of a polytope V if the closure of each normal cone of V is the 
union of closures of normal cones of Z. A standard result shows that Z is a refinement of V if 
and only if the normal cone of every vertex of Z is contained in the normal cone of some vertex 
of V, and we will use this property interchangeably with the above definition of refinement. A 
direction of an edge [u, v] of a polytope V is any nonzero scalar multiple of v — u. We provide a 
simple proof of the following fact (cf. |2Sj) which is quite central to our considerations. 

Lemma 2.3 Let V be a polytope and let E be a finite set of vectors containing a direction of 
every edge of V . Then the zonotope Z := zone(E) generated by E is a refinement ofV. 

Proof. Let E = {z 1 , . . . , z m }. Consider any vertex u of Z. Then u = Yli=i ^i z i f° r some 
Aj = ±1 and hence its normal cone consists of those a satisfying a-X{Zi > for all i. Let v be a 
vertex of V at which some such a (belonging to C^) is maximized over V . Consider any edge [v, w] 
of V . Then v — w = ctiZi for some scalar «j ^ and some Z{, and < a- (v — w) = a-ctiZi, implying 
a-iXi > 0. It follows that every a in the cone of the vertex u of Z satisfies a ■ (v — w) > for 
every edge of V containing v and therefore a is also in the cone Op of the vertex v of V, and hence 
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Cz Q Cp. Since u was arbitrary, it follows that the normal cone of every vertex of Z is contained 
in the normal cone of some vertex of V and we are done by the aforementioned standard result. □ 

Finally, we need the following statement about edge-directions of linear images of polytopes. 

Lemma 2.4 Let Q := uj(V) be the image of a polytope V under a linear map uj. Then every 
direction q of an edge of Q is the image under uj of some direction p of an edge of V . 

Proof. Let q be a direction of an edge [x,y] of Q. Consider the face F := cj _1 ([:c,y]) of 
V. Let V be the set of vertices of F and let U = {u G V : ui{u) = x }; as x ^ y, U ^ V. 
Further, as the graph of F is connected there must be an edge [u, v] of F, and hence of V, for 
some u G U and v G V \ U. Then u(v) G (x, y] hence uj(v) = x + aq for some Therefore 
q = —(uj(v) —uj(u)) = uj(—(v — u)) = uj(p) with p := —(v — u), a direction of the edge [u, v] of V. □ 



2.2 Reduction of convex to linear combinatorial optimization 

We now reduce convex to linear combinatorial optimization. We make the following assumptions. 
The ground set is N := {1, . . . , n} and the family T C 2 N is edge- guaranteed, which means that 
it is nonempty and comes with an explicit set E = {e 1 , . . . , e m } C Q n of vectors guaranteed to 
contain a direction of each edge of the polytope = conv{ lp : F G J-} associated with T. In 
this subsection we assume that J- is presented by a linear combinatorial optimization oracle that, 
given b : N — * Q, returns a family member F G T of maximum weight b(F) = X^jeF Ki)- The 
convex functional c : Q d — ► Q is presented by an evaluation oracle that, given x G Q d , returns 
the value c(x). The weighting w : N — ► Q d is given by an explicit list w(l), . . . ,w(n) G Q d . 
We consider d as fixed; otherwise, as mentioned before, the problem becomes intractable at once 
even for the simple family T = 2 N , see Example 13. II below. The following algorithm, applied to 
the data above, provides a reduction of convex to linear combinatorial optimization. 

Algorithm 2.5 Given data as above, perform the following steps: 

1. Consider the linear map ui : Q n — > Q d defined by ui(x) := Y^j=i w U) x j • 

(a) Compute the image uj(E) = {ui{e l ), . . . , w(e m )} of E under u. 

2. Consider the zonotope Z := zoiae(uj(E)) = ^™ x[— w(e*), u;(e 4 )] in d-space Q d . 

(a) Compute the list {u l , . . . , u k } of all vertices of Z. 

(b) For each u l compute a linear functional a 1 G Cj? in the normal cone of Z at vt. 

3. (a) For each a 1 compute b l : N — > Q defined by b l (j) := a 1 ■ w(j) = J2t=i a t w U)t ■ 

(b) For each ¥ query the oracle of T and obtain F % G T of maximum weight b l (F l ). 

(c) For each F l query the oracle of c and obtain the value c{w{F' 1 )) = c(^ Jgi?i w(J)). 
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4. Output F l G T of maximum value c(w(F 1 )) among F 1 , . . . , .F fc . 

Theorem 2.6 Alaorithm \2.5\ solves the convex combinatorial optimization problem with data as 
above in strongly polynomial oracle time using 0(nm d_1 ) arithmetic operations and 0{m d ~ 1 ) 
queries of the linear combinatorial optimization oracle of T and the evaluation oracle of c. 

Proof. First we justify the algorithm. Recall the polytope V£ = conv{ w(F) : F G J-}. As 

= cotw{w(F) : FeJ 7 } = conv{a;(l F ) : F G F} = u{V r ) , 

is the image of under the linear map to defined in step 1 of the algorithm. Thus, by 
Lemma 12.41 the image oj(E) of E under oj contains a direction of every edge of V^. Therefore, 
by Lemma 12.31 the zonotope 

m 

Z := zone(w(£)) = J^[-w(e*), w(e*)] 

i=l 

defined in step 2 is a refinement of V^j . Now consider any vertex v of V^. Since Z refines V^j , 
the normal cone of V^j at v contains the normal cone of Z at some vertex u l of Z found in step 
2a. This implies that the corresponding linear functional a 1 G (J^ found in step 2b is maximized 
uniquely over at v. Now, consider the corresponding weighting b l defined in step 3a. As v is 
the unique maximizer of a 1 over V^j = conv {w(F) : F G J 7 }, we have 

b l (F) = ^tf-wU) = a l -^w{j) = (j-w(F) < J-v 

for each F G with equality if and only if w(F) = v. Thus, the member F l G T obtained in step 
3b from the linear combinatorial optimization oracle of T when maximizing b % has v = w{F l ). It 
follows that every vertex of equals w{F l ) for some F l obtained in step 3b. Since c is convex, 
the maximum value c(w(F)) of F G J- occurs at some vertex v = w(F l ) of V^j- Thus the member 
F l G J- output by the algorithm in step 4, which has maximum value c(w(F 1 )) among the values 
computed in step 3c, is an optimal solution to the convex combinatorial optimization problem. 

Next we verify the claimed complexity, where, as explained, d is considered fixed. The com- 
putation of the linear image uj(E) in step la takes 0{dnm) = 0{nm) operations. By Lemma 
12. H the number of vertices of the zonotope Z defined in step 2 satisfies k = 0{m d ~ l ), and the 
computation of these vertices u l and of corresponding linear functionals a 1 in steps 2a and 2b 
requires (^(m^ -1 ) operations by Lemma 12.21 The number of queries in step 3b of the oracle of 
T and in step 3c of the oracle of c are k = 0(m d_1 ) as claimed. The computation of each b % 
in step 3a and of each w(F l ) in step 3c take 0(dn) = 0(n) operations totalling together over 
all i to O(kn) = 0(nm d_1 ) arithmetic operations. Finally, the arithmetic complexity of finding 
the maximum among the k values c(w(F 1 )) in step 4 is 0(k) = 0(m a! ~ 1 ). Thus, the dominant 
arithmetic complexity is Ofo-m^ 1 ) as claimed. □ 
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As the proof shows, convex combinatorial optimization is solved by enumerating all vertices 
of the polytope and picking the best. While each vertex of is the image under w of some 
vector lp, with F € F, the difficulty is that the number of sets in F is typically exponential in n 
and hence it is generally impossible to construct directly in polynomial arithmetic complexity 
(in particular, each lp is a vertex of V^). The efficient construction of V£j is made possible by 
the given set of edge-directions of and by proceeding, indirectly, through the zonotope Z that 
refines . While the number of vertices of Z can be much larger than that of V^, the vertices of 
Z can be better controlled and this leads to the polynomial complexity bound. So if the polytope 

of a family F admits a relatively small set containing a direction of each edge which can be 
efficiently constructed or even characterized, then the problem is efficiently reducible. 

2.3 Generic solution of the linear combinatorial optimization counterparts 

We now discuss how to realize an oracle that will repeatedly solve each of the linear combinatorial 
optimization counterparts queried upon in Algorithm 12.51 As before, we assume that our family 
F is edge-guaranteed and hence nonempty and comes with an explicit set E = {e 1 , . . . , e m } 
containing a direction of each edge of V* ' ■ We assume moreover that the family comes with one 
member F° € F to start with. We consider the following three oracle presentations of F . 

• Membership oracle: when queried about F C N, this oracle asserts whether or not F £ J-. 

• Augmentation oracle: when queried about F £ F and b : N — ► Q, this oracle returns a 
family member F £ F with b(F) > b(F) or asserts that F has maximum weight in T. 

• Linear combinatorial optimization oracle: when queried about b : N — ► Q, this oracle 
returns a family member F £ F of maximum weight b(F). 

Lemma 2.7 For any edge- guaranteed family, a membership oracle enables to simulate an aug- 
mentation oracle in strongly polynomial oracle time. 

Proof. Without loss of generality, assume that each e* is a {—1,0, l}-vector. The simulation 
is simple. Consider a query about F and b : N — ► Q. Call an edge-direction e l improving 
if b(e l ) = X^j=i Ki) e j' > 0; can ^ admissible at F if lp + e l is a {0, l}-vector whose support 
F l := supp(±F + e l ) is in T ' . If there is an edge-direction e % which is both improving and ad- 
missible then return F l \ otherwise assert that F has maximum weight in F . The simulation 
works correctly since, as is well known, a vertex u is not a maximizer of a linear functional b 
over a polytope if and only if the polytope has an edge [u, v] for some vertex v with b(v) > b(u). □ 

The next lemma is from [241 145j : see |44| for the state of the art on this line of research. It 
involves a computationally heavy Diophantine approximation step [201 an d a scaling step [TH] . 
We include an outline of the proof, which is relevant for the discussion in the next subsection. 

Lemma 2.8 For any family, an augmentation oracle enables to simulate a linear combinatorial 
optimization oracle in strongly polynomial oracle time. 
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Proof. We outline the simulation. Consider query about b : N — > Q. The Diophantine 
approximation step (see |2UI Theorem 3.3]) replaces b by a : N — ► Z with the following two 
properties: first, it is equivalent to b in that, for any pair F,G C JV, it satisfies a{F) < a{G) if 
and only b(F) < b(G); and second, the maximum number of bits k := 1 + max Jg Ar[log |a(j)|] in 
the binary representation of the weight under a of any element is polynomial in n. 

The scaling step (inspired by |19j ) is the following. Applying a simple transformation (see 
|45| ) we may assume a is nonnegative. Following the prof of |24l Theorem 9.2], for i = 0, . . . , k, 
starting with F l G J 7 , find a maximizing F l+1 G T with respect to the weighting a 1 := [2 l ~ k a\. 
As shown in [21] or [12] j each ,F* +1 is obtained from F % by calling the augmentation oracle at 
most n times. Thus, the maximizer F k+1 of a = a k is found using at most kn calls, and since k 
is polynomial in n, the desired maximizer of b is obtained in strongly polynomial oracle time. □ 

We can now prove Theorem 11.11 As discussed before, the family T C 2 N comes with one 
explicit F° G T and an explicit set E = {e 1 , . . . , e m } C Q n containing a direction of each edge of 
V^, and is presented by a membership oracle. The complexity is measured in terms of n and m. 

Theorem II. II For any fixed d, there is a strongly polynomial oracle time algorithm solving convex 
combinatorial optimization over any edge- guaranteed family presented by a membership oracle. 

Proof. Theorem 12.61 guarantees that Algorithm 12.51 solves the convex combinatorial optimiza- 
tion problem over J- efficiently using a linear combinatorial optimization oracle which, by Lemmas 
12.71 and 12.81 can be efficiently simulated from the membership oracle presenting T. □ 

We conclude this subsection with several important remarks which lead to the discussion 
in the next subsection. First, the complexity behind Theorem 11.11 is quite horrendous: for 
each linear combinatorial optimization counterpart invoked by Algorithm 12.51 an application of 
the Diophantine approximation step which takes 0(n 8 ) arithmetic operations [20] is required. 
However, improved complexity bounds follow from Theorem 12 . 61 when Algorithm 12 .51 is used with 
a more efficient linear combinatorial optimization oracle whenever a particular family admits one. 

Second, what about real data and real arithmetic computation (where pairs of real numbers 
can be added, multiplied or compared in unit time)? Algorithm 12 . 51 remains valid and polynomial 
and the analog of Theorem l2.6l fwith "strongly polynomial oracle time" replaced by "polynomially 
many real arithmetic operations and queries") holds. So does the conversion of the membership 
oracle to the augmentation oracle manifested by Lemma 12.71 above. However, the proof of both 
parts of Lemma 12.81 (scaling and Diophantine approximation) breaks down for real data, and the 
conversion of the augmentation oracle to the optimization oracle is no longer available. 

Can these obstacles be waved and does the real analog of Theorem 11.11 remain valid? Our 
families are edge-guaranteed, which is stronger than having a test set (see |46j and references 
therein) and even more so than having a mere augmentation oracle: can we take advantage 
of that and simulate standard linear combinatorial optimization directly and more efficiently, 
avoiding scaling and Diophantine approximation? We discuss some of these issues next. 
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2.4 On the Hirsch conjecture and the Klee-Minty problem for (0, l)-polytopes 

A form of the Hirsch conjecture, open to date, asks whether the diameter of (the graph of) every 
n-polytope with / facets is bounded above by a polynomial in n and /; for all one knows, the 
linear upper bound f—n may suffice, see [321134] . Also open is the analogous form of the monotone 
Hirsch conjecture asking whether the shortest increasing path under any linear functional from 
any vertex to some maximizing vertex is polynomially bounded in n and /. Both variants are 
true for (0, l)-polytopes [37] as well as for some more general classes of integer polytopes [THIHH] . 
The following slightly stronger form, relevant to the discussion below, holds. 

Lemma 2.9 Any (0,1) n-polytope P admits, under any linear functional, a nondecreasing path 
from any vertex u to any maximizing vertex w, of length at most n using no edge- direction twice. 

Proof. The claim being trivial for n = 0, we proceed by induction. If u and w lie on a common 
proper face of P then induction takes over. Otherwise, there is a nondecreasing arc (u,v); pick 
any % with m ^ vf, then Vi = W{ and hence, by induction, there is a nondecreasing (t>,u>)-path of 
length at most n — 1 using no edge-direction twice on the face F := {x € P : X{ = w{\. As no edge 
of F can have direction v — u, this path preceded by the arc (u, v) gives the desired (u, w)-path. □ 

The effective Hirsch conjecture asks, broadly, whether a monotone path could be efficiently 
traced. To make things precise, the presentation of the polytope has to be specified. For instance, 
tracing such a path in strongly polynomial time for polytopes presented by linear inequalities 
would imply a strongly polynomial time algorithm for linear programming via the simplex method 
which does not seem likely; but tracing it in sub exponential time is possible [21]. A natural 
question is: how long can an arbitrary increasing path be? A classical construction by Klee 
and Minty [SHI transforms the n-cube so as to admit increasing paths of exponential length 2 n . 
But (0, l)-polytopes are very special, as shows Lemma 12.91 How long, then, can an arbitrary 
increasing path in a (0, l)-polytope be? We now show that, unfortunately, (0, l)-polytopes admit 
such paths of length exponential in the dimension as well: in this sense, the following lemma can 
be regarded as a (0, 1) analog of the Klee-Minty cube; we thank Tal Raviv for a related discussion. 

Lemma 2.10 For every n there is a (0, l)-polytope of dimension less than |n 4 with n! vertices 
that admits a Hamiltonian (and hence n\-long) nondecreasing path under every linear functional. 

Proof. The Young polytope Y n -2.2 is the convex hull of all (2) x (2) matrices of permutations 
of edges of the complete graph K n induced by the n\ permutations of its vertices. For instance, 
the matrix corresponding to the permutation of vertices a = (1, 2, 3, 4) (in cycle notation) is 



S = 
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The claims about the dimension and number of vertices of Y n —2,2 are obvious. In jHH] it was 
shown that 1^-2,2 is 2-neighborly, that is, its graph is the complete graph K n \ : the very existence 
of 2-neighborly (0, l)-polytopes is an amazing fact in itself! It follows that if a is any linear 
functional, then any ordering v\,... ,v n \ of the vertices of 1^-2,2 satisfying a(v\) < • • • < a(v n \) 
gives a Hamiltonian (and hence exponentially long) nondecreasing path under a. □ 

Here, however, we are especially interested in the polytopes of edge- guaranteed families. 
Lemmas 12 . 71 and 12 . 81 imply that for such a family, a monotone path can be traced in time strongly 
polynomial in m and n, alas, for rational functionals only, and using the heavy Diophantine ap- 
proximation procedure. Can we do better? what is the maximal length I(n, m) of any increasing 
path in any n-dimensional (0, l)-polytope with m pairwise nonproportional edge-directions? 

2.5 Edge- well-behaved classes 

In most applications, in particular all of those discussed in Section 3, one is concerned with a 
class of families possessing some unifying structure. It is therefore useful to make some formal 
definitions regarding such classes and then use it to obtain a suitable corollary of Theorem 11.11 
For n > let N = {1, . . . , n} as before and let U n be the set of all families with ground set N, 

U n := 2 2{1 ""' n} = {T : TC2 N }. 

A class of families is a (typically infinite) set of families C = \+j n >o^ n w ^ n Cn Q U n for all n. 

Definition 2.11 A class C is edge-well-behaved if there is a polynomial time algorithm that, 
given n, produces a set E n = {e 1 , . . . , e m ^} C {—1,0, l} n with respect to which every T € C n is 
edge- guaranteed. In particular, m(n) is polynomial in n and each T € C is nonempty. 

While the existence of such a "uniform" polynomial time algorithm that produces sets contain- 
ing edge-directions for the polytopes of all families in the class may seem a strong assump- 
tion, we will see in Section 3 that in many applications such an algorithm is readily avail- 
able. Also, the assumption that the edge-directions are (—1,0, l)-valued is not restrictive since, 
for (0, l)-polytopes, each edge is a difference of two vertices hence admits a (—1,0, Indirection. 
The next corollary follows at once from Theorem II .11 here the complexity is in terms of n only. 

Corollary 2.12 Fix any d. Then for any edge-well-behaved class there is a strongly polynomial 
oracle time algorithm that solves the convex combinatorial optimization problem over any family 
in the class which is presented by a membership oracle. 

While this statement may seem a reformulation of Theorem 11.11 it is natural and useful in 
uniformly establishing the polynomial solvability in all of the applications discussed in Section 3. 

2.6 Projection representation and circuits 

We conclude Section 2 by discussing a useful setup that helps in controlling edge-directions, and 
which is used and demonstrated in the application given in §3.2 in the sequel. A circuit of an r x s 
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matrix A is a nonzero solution z E Q s of the system Az = whose support is inclusion-minimal. 
It is known (cf. |431 Exercise 10.14]) that any nonzero solution of Az = has a conformal circuit 
decomposition, i.e. can be expressed as z = Yli a % z% with the on positive scalars and the z % 
pairwise nonproportional circuits such that z^Zj > for all i and all j E supp(z*). Consider the 
standard polytope V = {x E Q s : Ax = 6, I < x < u} defined by A, right-hand side b E Q r , 
lower bound I G Q s , and upper bound a £ Q s . We provide a short proof of the following useful 
property of edge-directions of the standard polytope (see [30] for a refinement of this property 
which characterizes edge-directions). 

Lemma 2.13 Each edge- direction of a standard V is a circuit of its defining matrix A. 

Proof. Consider any x, y E V. Then A(y — x) = so y — x admits a conformal circuit decom- 
position y — x = Yli a i zt - It i s then not hard to verify that for every circuit z 1 participating in 
that decomposition, both x + onz 1 and y — aiz 1 satisfy the lower and upper bounds and hence are 
in V. They belong, moreover, to any face F containing both x and y. Indeed, pick any c in the 
normal cone C^: then c • x > c ■ (x + a^z 1 ) which implies c • z % < 0, and c • y > c • (y — aiz % ) which 
implies c • z % > 0. It follows that c • z l = and hence c • {x + oiiZ 1 ) = c ■ x and c • (y — ctiZ 1 ) = c • y 
implying x + a^z 1 ,y — ctiZ 1 G F. Now, if the decomposition y — x = Y^i a i z% involves more 
than one circuit, say z 1 ^ 2 , then any face containing x,y contains the three non-collinear points 
x, x + a\z l ,x + ct2Z 2 and hence is not an edge. So if [x, y] is an edge of V then y — x = az for 
some circuit z. Any direction of that edge is a nonzero multiple of z and hence a circuit of A. □ 

Lemma 12 . 1 31 implies that any inclusion-maximal set Z of pairwise nonproportional circuits of 
A contains a direction of each edge of V . So if the size of A is r x s then V admits such a set Z 
with no more than ( s ) elements. Every polytope Q is the linear image of a standard polytope: 
if Q = {x : Bx < b} is a description by inequalities then, adding a suitable "slack" vector y, we 
get Q = <p(V) with V = {(x,y) : Bx + Iy = b, y > 0} and with cf> the u y forgetting" projection 
4>(x, y) = x. In particular, the polytope of any family T is the linear image V T = <f>(P) of 
a standard polytope V = {x G Q s : Ax = b, I < x < u}. Typically the number of inequalities 
describing is exponentially large and hence so is the dimension of V, but when V has small 
dimension, we can benefit from such a "projection representation" in two ways as follows. 

First, if the defining matrix A admits an efficiently determinable set Z = {z 1 , . . . , z m } C Q s 
containing a scalar multiple of each circuit of A then, by Lemmas l2.4l and l2.13l its image E := <f>(Z) 
contains a direction of each edge of , making T an edge-guaranteed family. Second, linear 
combinatorial optimization over T can be "lifted" to linear programming over the polytope V, 
giving a way alternative to §2.3 for solving the counterparts called upon by Algorithm 12.51 

3 Some Applications 

3.1 Some direct applications 

Here we give two examples where the set of edge-directions can be directly determined and used. 
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Example 3.1 Positive semidefinite quadratic assignment. The quadratic assignment 
problem is the following: given a real rex n matrix M, find x € {0, 1}™ maximizing the quadratic 
form x T Mx induced by M; see |42j for an overview of this problem and its applications. We 
consider the instance where M is positive semidefinite, in which case it can be assumed to be 
presented as M = W T W with W a given d x n matrix. If the rank d of W and M is variable 
then this problem is NP-hard [22j. For fixed d it is polynomial time solvable [2j. 

When W is rational, the problem can be modelled as convex combinatorial optimization 
with the following data: the family is the entire power set F = 2 N of N with the natural 
correspondence F <-> {0, 1}"; the weight of j € N is the jth column w(j) := W 3 of the matrix 
W; and c : Q d — ► Q : x i— ► ||x|| 2 is the squared standard 1% norm. Indeed, for each F £ J- we 
then have 1 T F W T W1 F = c(w(F)). 

Now, the polytope of the family T here is just the n-cube = [0, l] n ; therefore the trivially 
computable set of n standard unit vectors E := {li, . . . , l n } contains a direction of each edge. 
Thus, the class of all such families is edge-well-behaved with m(n) = n and Corollarv l2. 121 applies 
and guarantees the efficient solution. Here, one obtains a faster solution by using Algorithm 12.51 
together with a linear combinatorial optimization oracle realized by simple sign checking as 
follows: given b : N — > Q, a member F € J- maximizing b(F) is simply F := {j : b(j) > 0}. 

Example 3.2 Convex matroid optimization. This problem is the special case of convex 
combinatorial optimization where T is either the collection B of bases (considered in [22] ) or the 
collection X of independent sets of a matroid over N. It generalizes classical matroid optimization, 
first studied in ^H], and has a rich modelling power on its own: useful matroids include the forest 
matroid of a graph and, more generally, the matroid of linear dependencies of a matrix over a 
field. For us here it suffices that the matroid is presented by a membership oracle for J- . 

It can be derived from the matroid-bases-axioms that the trivially computable set D := 
{lj — lj : 1 < i < j < n} of (g) differences of unit vectors contains a direction of each edge of the 
polytope V s of the family B of bases. Likewise, it can be derived from the matroid-independence- 
axioms that the (™t ^-element union D U E of D and the set E := {li, . . . , l n } of unit vectors 
contains a direction of each edge of the polytope V 1 of the family X of independent sets. Thus, 
the class of all such families B (respectively, families I) is edge-well-behaved with m(n) = Q) 
(respectively, m(n) = ("2^)), and Corollary 12.121 applies and guarantees the efficient solution. 

Here too, one obtains a faster solution by using Algorithm 12 . 51 together with a linear combina- 
torial optimization oracle over T = B or T = F which is efficiently realizable from a membership 
oracle for T using the classical greedy algorithm (cf. |131 118j ) that, given b : N — > Q makes 
use of sorting the values b(j) to find the lexicographically 6-largest member F € F which can be 
shown to be the one maximizing b(F). 

3.2 Shaped vector partitioning 

The shaped partition problem concerns the partitioning of a multiset V = {v 1 , . . . , v n } of n vectors 
in (f-space into p parts so as to maximize an objective function which is convex on the sum of 
vectors in each part, subject to constraints on the number of elements in each part. To describe 
the problem precisely we need some notation. A p-partition of the index set {1, . . . , n} of V is an 
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ordered tuple it = (tti, . . . , ir p ) of pairwise disjoint sets whose union is {1, . . . , re}. The shape of 
a partition is the tuple of cardinalities of its parts, |tt| := (|ttx|, . . . , |7rJ). In addition to the set 
of vectors V, the data includes vectors I, u € {0, 1 . . . , n} p with I < u providing lower and upper 
bounds on the shape of admissible partitions. With each partition ir is associated a d x p matrix 




whose jth column is the sum (representing the "total value") of vectors assigned to the jth part. 
The data also includes a convex functional c : Q dxp — ► Q on d x p matrices which "weighs 
together" the sums of vectors in the various parts. The problem is to find a p-partition ir whose 
shape satisfies the lower and upper bounds I < < u and which maximizes the value c(V r ). 

Shaped partition problems have applications in diverse fields such as clustering, inventory, 
reliability, and more - see [3 11UI 1121 1291 1311 142j and references therein. Here is a typical example. 

Example 3.3 Minimal variance clustering. This is the following problem, which has numer- 
ous applications in the analysis of statistical data: given n sample points v , . . . ,v n in <i-space, 
group the points into p clusters ix\ , . . . , ir p so as to minimize the sum of cluster variances 

E^£f-(^£»->ii 2 - 

j = l J i^TTj J i£irj 

We consider the instance where there are n = p ■ m points and the clustering sought is balanced, 
that is, the clusters should have equal size m. Suitable manipulation of the sum of variances 
shows that the problem is equivalent to a shaped partition problem with the lower and upper 
bounds I := u := (m, . . . , m) (forcing the single shape \ir\ = (m, . . . , m) on partitions), and with 
the convex functional (to be maximized) simply as the square of the I2 norm on d x p matrices, 
given by c : Q dx P — ► Q : X h-> ||X|| 2 = ^ti £;=i l^jf- 

If either the dimension d or the number of parts p is variable, the shaped partition prob- 
lem instantly captures NP-hard problems hence is presumably intractable [201 • Therefore, it is 
interesting to study the worst case arithmetic complexity in terms of the number re of points 
with both d,p fixed. In the special case where there are no shape restrictions (partitions of all 
shapes are admissible), an upper bound of 0{n d ^ p ~'^~ 1 ) on the complexity is given in 41 and a 
quite compatible lower bound of Vl{ri L ~^~^ p ) is in 5 . In the more general case where arbitrary 
sets of shapes are allowed, the best upper bound to date is 0(n dp ) from |29*] : while a matching 
lower bound is unknown, the lower bound 0{n d ^) from 3 on the related number of separable 
partitions indicates that the quadratic term p 2 in the exponent may be unavoidable. 

We now show how to solve the shaped partition problem efficiently using our framework. We 
begin by modelling it as a convex combinatorial optimization problem. The ground set is taken to 
be N := {(i,j) ■ 1 <i < n, 1 < j < p}. Each p-partition tt = (tti, . . . ,tt p ) is encoded as the set 
F-k := {(i,j) ■ i € 7Tj} Q N. The family consists of all such sets corresponding to ^-partitions of 
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admissible shapes, T := {F n : I < \ir\ < u}. The weight of element (i, j) G N is the d x p matrix 
w(i,j) := v l lj whose jth column is v % and whose other columns are zero. Finally, the convex 
functional is simply the given one c defined on d x p matrix space. It is not hard to verify that 
this indeed casts the shaped partition problem as a convex combinatorial optimization problem 
with a ground set of size \N\ = np and weight vectors (matrices) in dimension dp. 

To show that the class of all such families is edge- well-behaved we discuss the family polytope 
C Q nxp . The indicator of a family member F w £ T is the (0, l)-valued n x p matrix l w 
whose (i, j)th entry equals 1 precisely when i G ttj. The polytope admits a simple projection- 
representation as follows. Consider {n + 1) x p matrices whose rows are indexed by {0, 1, . . . , n}. 
Define lower and upper bound matrices L, U in terms of the given vectors l,u as follows: for 
j = 1, . . . ,p set Lij := 0, Uij := 1 if 1 < i < n and Lqj := n — Uj, Uqj := n — lj. Let V be the 
transportation polytope defined by this data, which is the following standard polytope 

{p n 
X £ Q( n + l )*P : ^Xij = 1 (1 < i < n), = n (1 < j <p), L<X <U 

j=l i=0 

Then V T = <f>(V) with <\> : Q( n+1 ) x P — > Q nx P the projection erasing the 0th row of a matrix. 

Let K n+ i^ p be the complete bipartite graph with edge set {(i,j) ■ < i < n, 1 < j < p} 
corresponding to this transportation system. Each circuit of K n+ \ )P gives an (n+1) xp matrix sup- 
ported on that circuit, with values ±1 alternating along the edges of the circuit and elsewhere. 
It is well known that each circuit of the (n+p) x (n + l)p matrix of coefficients of the equation sys- 
tem defining V is proportional to some such circuit-supported matrix. Let Z := {z , . . . , z m } be 
the set of all such (n + 1) xp matrices corresponding to them := X^f=2^(?) ("T 1 )*'^ - -0' distinct 
circuits of K n+ i tP . Then the projection E = {e 1 ,...^" 1 } := (p(Z) is a set of (— 1, 0, l)-valued 
n x p matrices which, as explained in §2.6, contains a direction of each edge of . 

Since p is assumed to be fixed, the set of circuits Z and its projection E are computable 
in time polynomial in n, and therefore the class of all such families is indeed edge- well-behaved 
with m(n) = Yli=2 Ki'X"*!' 1 )^^ — = 0(n p ). Thus, Corollary 12 . 1 21 applies and guarantees the 
efficient solution. As in the examples in §3.1, here too one can obtain a typically faster solution 
by using Algorithm 12 . 51 together with a linear combinatorial optimization oracle realized by lifting 
to V and solving the corresponding transportation problem using available fast algorithms for 
bipartite network flows, see pQ. These algorithms, however, are not strongly polynomial and do 
depend on the bit size of the bounds l,u and the costs b l called repeatedly by Algorithm 12.51 

For unrestricted partitioning, that is, shaped partitioning with I = (0, . . . , 0) and u = (n, . . . , n), 
the characterization of circuits obtained in [301 shows that circuits of K n+ \^ which yield circuits 
of the matrix of coefficients of the equation system defining V correspond to switching a single 
item from one part to another. As the number of such circuits is 71(2) , the class of such families is 
edge- well-behaved with (improved) m(n) = nffl. Thus, while Theorem 12.61 with m(n) = 0(n p ) 
and dimension dp implies a complexity bound of 

n o(d P 2 ) Qn 

the general shaped partition problem, 

in line with (22], with m(n) = n^) and same dimension dp it implies the improved bound of 
n O(d P ) on 

the complexity of the unrestricted partition problem, in line with [4*T] . 
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4 Concluding Remarks 

In this article we have defined the convex combinatorial optimization problem and shown that it 
can be solved in strongly polynomial time for edge-guaranteed families and for edge-well-behaved 
classes of families. We have demonstrated several natural and broad applications that indeed give 
rise to edge-well-behaved classes and therefore are efficiently solvable through our framework. 

The polynomial time solvability of linear combinatorial optimization for facet-well-behaved 
classes via the Ellipsoid method |25[ \',VA\ has stimulated over the years a broad body of work on 
the identification and characterization of such classes. A major research program called upon 
by this paper is an analogous identification and characterization of edge-well-behaved classes of 
combinatorial families, for which our framework automatically yields strongly polynomial time 
solvability of convex combinatorial optimization. 

Some more specific questions are discussed within the body of our paper, in particular, those 
in §2.3 and §2.4 concerning a more efficient generic solution of standard linear combinatorial 
optimization over edge-guaranteed families and the effective Hirsch conjecture for polytopes of 
such families. What is the maximal length I(n, to) of any increasing path in any n-dimensional 
(0, l)-polytope with to pairwise nonproportional edge-directions? Can we trace such a path 
efficiently while avoiding scaling and the heavy Diophantine approximation procedure? Can we 
trace such a path in polynomially many real arithmetic operations for real linear functionals? 

For solving the standard linear counterparts of a convex combinatorial optimization problem 
over a family J- with weighting w, one approach may be to try and augment along edges of the 
polytope downstairs (see discussion following the proof of Theorem 12 .6j) . While a set of edge- 
directions of that polytope is available as the projection co(E) of the set of edge-directions of , 
this information is not enough: one needs to know, at any vertex v of , which edge-directions 
u{e l ) are admissible at v, and moreover - "how much to walk" - namely, what is the nonnegative 
scalar a such that v + ae % is the new vertex to move to. When this information is available, 
even in an abstract setting of a suitably defined neighborhood oracle, it may be possible to apply 
the vertex enumeration methodology of 0; and in some applications, such as unrestricted vector 
partitioning, this can be carried out particularly efficiently as in |21| . 

Finally, as mentioned in the introduction, there is much room for the study of approximation 
algorithms for the often intractable convex combinatorial optimization problem for various classes 
of families, and we hope this article will stimulate research on this yet unexplored ground. 
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